Creating a speech corpus with semi-spontaneous, parallel conversational and clear speech Tech Report: CSLU-11-003

نویسندگان

Alexander Kain

John-Paul Hosom

Sarah Hargus Ferguson

Brian Bush

چکیده

Our goal is to collect a speech corpus for the purpose of studying intelligibility and acoustic differences between the conversational and clear speech styles. The ideal corpus has the following properties: (1) speech has been produced spontaneously as part of a communicative interaction, as opposed to having been read to an imagined interlocutor; (2) entire identical utterances, or large parts of utterances, are available in both conversational and clear speaking styles, also known as parallel recordings; and (3) utterances comprehensively and systematically cover the space of prosodic and phonetic features. We call the spontaneous (i. e. non-read) elicitation of speech with highly anticipated content (established through a given task) semi-spontaneous. We now discuss these desirable properties in more detail.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Pronunciation variant analysis using speaking style parallel corpus

To improve the recognition accuracy for spontaneous conversational speech, we collected a corpus to study how spontaneous conversational speech differs from read style speech. The corpus consists of two parts: 1) spontaneous conversational speech and 2) read speech with the same word transcriptions as the conversational speech. In word and phone recognition experiments, it was confirmed that, f...

متن کامل

Connected Digit Recognition Experiments with the OGI Toolkit's Neural Network and HMM-Based Recognizers

This paper describes a series of experiments that compare different approaches to training a speakerindependent continuous-speech digit recognizer using the CSLU Toolkit. Comparisons are made between the Hidden Markov Model (HMM) and Neural Network (NN) approaches. In addition, a description of the CSLU Toolkit research environment is given. The CSLU Toolkit is a research and development softwa...

متن کامل

Quantitative Analysis of Pitch in Speech of Children with Neurodevelopmental Disorders

We analyzed the prosody of children with Autism Spectrum Disorder, Developmental Language Disorder, and typical development in conversational speech, using the CSLU ADOS speech corpus. We found several significant differences in the pitch characteristics of these diagnostic groups, and report automatic classification utilizing these features that are well above chance level. We show that the ch...

متن کامل

Construction of Chinese Segmented and POS-tagged Conversational Corpora and Their Evaluations on Spontaneous Speech Recognitions

The performance of a corpus-based language and speech processing system depends heavily on the quantity and quality of the training corpora. Although several famous Chinese corpora have been developed, most of them are mainly written text. Even for some existing corpora that contain spoken data, the quantity is insufficient and the domain is limited. In this paper, we describe the development o...

متن کامل

An undergraduate course on speech recognition based on the CSLU toolkit

This paper describes an undergraduate course in speech recognition, based on the CSLU Toolkit, which was taught at the Universidad de las Américas in Puebla, México. Throughout the course, laboratory assignments based on the toolkit guided students through the process of creating a recognizer, while in-class lectures consistently refereed to the architecture of the toolkit as a concrete example...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2011

Creating a speech corpus with semi-spontaneous, parallel conversational and clear speech Tech Report: CSLU-11-003

نویسندگان

چکیده

منابع مشابه

Pronunciation variant analysis using speaking style parallel corpus

Connected Digit Recognition Experiments with the OGI Toolkit's Neural Network and HMM-Based Recognizers

Quantitative Analysis of Pitch in Speech of Children with Neurodevelopmental Disorders

Construction of Chinese Segmented and POS-tagged Conversational Corpora and Their Evaluations on Spontaneous Speech Recognitions

An undergraduate course on speech recognition based on the CSLU toolkit

عنوان ژورنال:

اشتراک گذاری